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Abstract 

We consider the MIMO wiretap channel, that is a MIMO broadcast 
channel where the transmitter sends some confidential information to one 
user which is a legitimate receiver, while the other user is an eavesdropper. 
Perfect secrecy is achieved when the the transmitter and the legitimate 
receiver can communicate at some positive rate, while insuring that the 
eavesdropper gets zero bits of information. In this paper, we compute the 
perfect secrecy capacity of the multiple antenna MIMO broadcast channel, 
where the number of antennas is arbitrary for both the transmitter and 
the two receivers. 



1 Introduction 

Security in wireless communication is a critical issue, which has recently at- 
tracted a lot of interest. By nature, wireless channels offer a shared medium, 
particularly favorable to eavesdropping. Among the numerous points of view 
from which security has been investigated, we adopt here the one of informa- 
tion theoretic security. In this context, most of the works dealing with wireless 
communication are based on the seminal work of Wyner [16j , and its model, the 
wire-tap channel. 

1.1 Information theoretic confidentiality 

In a traditional confidentiality setting, a transmitter (Alice) wants to send some 
secret message to a legitimate receiver (Bob), and prevent the eavesdropper 
(Eve) to have knowledge of the message. 

From an information theoretic point of view, the communication channel 
involved can be modeled as a broadcast channel, following the wire-tap channel 
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model introduced by Wyner [TB]: a transmitter broadcasts its message, say 
G W*^, encoded into a codeword x", and the two receivers (the legitimate 
and the illegitimate) respectively receive and 2;", the output of their channel. 
The knowledge that the eavesdropper gets of w'^ from its received signal z" is 
modeled by 

/(z»;U,fe) = ;i(iy'=)-/l(ty'=|z"), 

since the mutual information measures the amount of information that z" con- 
tains about The notion of perfect secrecy captures the idea that whatever 
are the resources available to the eavesdropper, they will not allow him to get 
a single bit of information. Perfect secrecy thus requires 

/(z";w'=)=0 = /i(w'=|z"). 

In other words, the amount of randomness is the same in or in w''\z". 

The decoder computes an estimate li'^ of the transmitted message w'', and 
the probability Pe of decoding erroneously is given by 

Pe = Pr{w^ 7^ w''). (1) 

The amount of ignorance that the eavesdropper has about a message w'^ is called 
the equivocation rate, and following the above discussion, it is naturally defined 
as: 

Definition 1 The equivocation rate Re at the eavesdropper is 

= -/i(w'=|z"), 
n 

with < i?e < h{w^)/n. Clearly, if is equal to the information rate h{w^)/n, 
then I{z'^\w'^) = 0, which yields perfect secrecy. 

To perfect secrecy is associated a perfect secrecy rate Rg , which is the amount 
of information that can be sent not only reliably but also confidentially, with 
the help of a (2"-'^= , n) code. 

Definition 2 A perfect secrecy rate Rs is said to be achievable if for any e > 0, 
there exists a sequence o/(2"^%n) codes such that for any n > n{e), we have 

Pe < e (2) 
Rs~e < Re- (3) 

The first condition ([2]) is the standard definition of achievable rate as far as 
reliability is concerned. The second condition ([3]) guarantees secrecy, up to the 
equivocation rate, which we will require to be h{w^)/n to have perfect secrecy. 
The secrecy capacity is defined similarly to the standard capacity: 

Definition 3 The secrecy capacity Cg is the maximum achievable perfect se- 
crecy rate. 
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1.2 Previous work 



In his seminal work [TQ, Wyner showed for discrete memoryless channels that 
the perfect secrecy capacity is actually the difference of the capacity of the two 
users. To prove this result, he worked under the assumption that the channel of 
the eavesdropper is a degraded version of the channel of the legitimate receiver. 
This result has been generalized to Gaussian channels by Leung et al. [7] , under 
the same assumption. 

The wire-tap channel has been adopted as a model for numerous works on 
information theoretic security, and in particular for those on fading channels, 
both for point-to-point and multi-user systems. We mainly review the prior 
work for point-to-point. In [5], Gopala et al. have shown that the secrecy ca- 
pacity is also the difference of the two capacities in the case of a single antenna 
fading channel, under the assumption of asymptotically long coherence inter- 
vals, when the transmitter either knows both channels or only the legitimate 
channel. When only the legitimate channel is known, an optimal power allo- 
cation is given, using a variable rate transmission scheme. In [I], Barros et al. 
have characterized information theoretic security in terms of outage probabil- 
ity. In the case when the transmitter does not know the eavesdropper channel, 
they define the probability of transmitting at a secrecy rate Rs bigger than the 
secrecy capacity Cs (i.e. the outage probability) as the probability that the 
information theoretic security is compromised. They compute this probability, 
and also show that the probability that the secrecy capacity Cs is positive can 
actually be positive even if the average SNR of the legitimate channel is weaker 
than the one of the eavesdropper. They extend their work in [2], where they 
also consider the cases when Alice has either imperfect or perfect knowledge of 
the eavesdropper channel. Independently, Liang et al. [H] and Li et al. [TU] 
have computed the secrecy capacity for the parallel wiretap channel with inde- 
pendent subchannels, and derived optimal source power allocation. The secrecy 
capacity of the wiretap channel with single antenna fading channel follows. Fi- 
nally, the results of [12] are extended in [13], where a fading broadcast channel 
with confidential messages is considered, with common information for two re- 
ceivers, and confidential information intended for only one receiver. The secrecy 
capacity is computed for the parallel broadcast channel with both independent 
and degraded subchannels. 

In this work, we are interested in the perfect secrecy capacity of multiple 
antenna channels. A first study of the problem has been proposed by Hero 
[S] . In a different context than the wire-tap channel, he introduced the so-called 
constraints of low probability of detection, and low probability of intercept, con- 
sidering the scenario where the transmitter and the receiver arc both informed 
about their channel while the eavesdropper is uniformed about his. In 9 , the 
SIMO wiretap channel has been considered. Several results on the secrecy in 
MIMO communication have been provided very recently. In [11] . the secrecy 
capacity is computed for the MISO case. Furthermore, a lower bound is com- 
puted in the MIMO case. This lower bound, that is the achievability, is shown 
to be the expected result, namely, the difference of the two channel capacities. 
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like in the previous cases. Finally, the secrecy capacity for the MISO case has 
been proven independently by Khisti et al. [6j, where furthermore an upper 
bound is given for the MIMO case, in a regime asymptotic in SNR. 

The contribution of this paper is to compute the perfect secrecy capacity 
of the multiple antenna wire-tap channel, for any number of transmit /receive 
antennas, as well as for any SNR regime. One of the difficulties in studying the 
MIMO wire-tap channel is that the broadcast MIMO channel is not degraded, an 
assumption which is crucial in the proof of the converse in the original paper by 
Wyner (as well as in the proofs presented in [71 [51 [TJ [T2] ) . In order to compute 
the secrecy capacity, we provide a proof technique for the converse, which is 
different than the original one, and allows us to deal with channels that are not 
degraded. Note that our result shows that the inner bound by Li et al. [11] is 
tight, and this is proved by the computation of an upper bound that actually 
matches the lower bound. 

1.3 The MIMO wiretap channel 

We consider the MIMO wiretap channel, that is, a broadcast channel where the 
transmitter is equipped with n transmit antennas, while the legitimate receiver 
and an eavesdropper have respectively um and he receive antennas. Thus, our 
model is described by the following broadcast channel 

Y = HmX + Vm 
Z = HeX + Ve 

where Y, Vm and Z, Ve are respectively um x 1 and n^; x 1 vectors. The notation 
that we will use throughout the paper is that the subscript M refers to the main 
channel (the one of the legitimate receiver), while the subscript E refers to the 
eavesdropper channel. We will denote by I„ the n x n identity matrix, and by 
0„ the n X n all zero matrix. We may omit the subscript if the dimension is 
obvious. 

We make the following assumptions: 

• X is the n X 1 transmitted signal, with covariancc matrix Kx ^ 0„ satis- 
fying the power constraint 

Tt{Kx) = P. 

The power constraint holds for the whole paper, and we may sometimes 
omit to repeat it explicitly. 

• Hm and He are respectively um x n and ue x n fixed channel matrices 
such that 

HIjHm >~ On, He^e >- 0„. 
They are assumed to be known at the transmitter. 
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• VmjVe are independent circularly symmetric complex Gaussian vectors 
with identity covariance Km — I-um j ^"^e — Ike and independent of the 
transmitted signal X . 

Theorem 1 The secrecy capacity of the MIMO wiretap channel is given by 

Cs = max logdet(I + HmKxHIi) - logdet(I + HeKxH*^) 

Kx f 

where Tt{Kx) = P- The paper contains the proof of the above theorem: in 
Section [21 we prove an achievability result which characterizes the optimal ma- 
trices Kx, while Section [3] contains the main results, namely the proof of the 
converse. 

2 On the Achievability 

In this section, we state the achievability part of the secrecy capacity, and further 
prove that in the non-degraded case, the achievability is maximized by n x n 
matrices Kx which are low rank, that is of any rank r < n. 

Proposition 1 The perfect secrecy rate 

Rs = max log det(I + HmKxHIi) - log det(I + HeKxH^) 

KxhO,Tr(Kx)=P 

is achievable. 

This has already been proved [TT]. In fact, the interpretation is obvious. When 
Kx is chosen, the difference between the resulting mutual informations to the 
legitimate user and eavesdropper can be secretly transmitted. 

Proposition 2 Let Kx be an optimal solution to the optimization problem 

ma^Kx logdet(I + HmKxHIj) - logdet(I + HeKxH%) 
s.t. Kx h 0, Tr{Kx) - P, 

where H*^He — H'IjHm is either indefinite or semidefinite. Then Kx is a low 
rank matrix. 

Proof. In order to show that the optimal Kx is low rank, we define a 
Lagrangian which includes the power constraint, and show that this yields no 
solution. From there, we can conclude that the optimal solution is on the 
boundary of the cone of positive semi-definite matrices, namely matrices of 
rank r < n. 

We thus define the following Lagrangian: 

logdet(I„^, + HmKxHIj) - logdet(I„^ + HeKxH^) - ATr(Xx), 
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and look for its stationary points, that is for the solution of the following equa- 
tion: 

Vkx (logdct(I + HmKxHIi) - logdet(I + HeKxH*^) - ATr(ifx)) = 
^ {{HIjHm)-^ + Kx)-^ = {{H*eHe)-^ + Kx)-^ + AI„. 

(4) 

By pre- multiplying the above equation by {K x + iH'^.j H m) ^) and post-multiplying 
it by {Kx + {H*eHe)-^), we get 

{H*eHe)-'+Kx = {HIjHm)-^+Kx+\{{HI,Hm)-^+Kx){{H*eHe)-^+Kx), 
or equivalently 

{{H*eHe)-' - {HIjHm)-')j = {{H*mHm)-' + Kx)m*EHE)-' + Kx). (5) 

Now, we have by assumption that H^^Hm >~ 0„ and H^He >- 0„. If further- 
more Kx >- 0, then aU the eigenvalues of {{HI.jHm)~^ +Kx){{H*^HEy^ +Kx) 
are strictly positive (see Lemma[2l in Appendix). This implies that ^ can have 
a solution if and only if the Hermitian matrix {{H'^He)~^ — {H'IjHri)~^)j is 
positive definite. This means that either H^Hm >- H^He and A > 0, or 
HIjHm -< H'^He and A < 0. This gives a contradiction if HIjHm — H'^He is 
either indefinite or semidefinite, implying that Kx has to be low rank. 



3 Proof of the Converse 

The goal of this section is to prove the converse, namely 

Theorem 2 For any sequence of (2"^= , n) codes with probability of error Pe < e 
and equivocation rate Rs — e < Re for any n > n{e), e > 0, then the secrecy rate 
Rs satisfies 

R, < max log det(I + HmKxHIj) - log det(I + HeKxH*e). 

KxhO,Tr{Kx)=P 

The proof is done in three main steps, that we briefly sketch before entering 
into the details. 

First (subsection [XT]) J we have, similarly to [71[S] that 

Rs-e< -[/(X";r"|Z")-t-(5], e,<5>0. 
n 

Thus, all the work consists of finding an upper bound on I{X;Y\Z). We will 
prove the following upper bound: 

I(X:Y\Z) < max i(X;Y\Z), 

KxhO 
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where 



IiX;Y\Z) = \ogdct\ln + {Hl,, H*^)[ V / ) { "u' ) 



-\ogdei{l + HEKxH*E) 

and A is an um x matrix which denotes the correlation between Vm and 
Ve- At this point of the proof, the converse can be proved for the two "simple" 
cases when H^^Hm > H^He and H'^He >- H^Hm, which are the cases when 
the channel is degraded. 

In general, Vm and Ve are independent. However, since the secrecy capacity 
does not depend on A, we can assume that I{X; Y\Z) is a function of both A 
and Kx for the purposes of tightening our upper bound . We show (subsection 
13. 2p that I{X; Y\Z) is actually concave in Kx and convex in A. As a result, we 
obtain a new upper bound 

I(X;Y\Z) < max i(X;Y\Z), 

KxhO 

for all A such that I — AA* >- 0„g, thus 

I(X;Y\Z) < min max 

= max mm i(X;Y\Z). 
KxhO A 

Furthermore, we jointly optimize I{X : Y\Z) over Kx and A, and compute the 
optimal A in closed form expression, while showing that the optimal Kx is on 
the boundary of its domain, namely, Kx is low rank. 

We conclude the proof (subsection l3.3p by showing that the converse matches 
the achievability. 

3.1 Bound on I{X;Y\Z) and result for the degraded case 

We start by recalling a standard result, which has already been proved in [7l|5]. 



Lemma 1 Given any sequence of (2"^= , n) codes with Pe < £ and Rg — e < Re 
for any n > n{e), e > 0, the secrecy rate Rg can be upper bounded as follows: 

Rs-e<-[I{{X",Y"\Z'') + S], 
n 

for e,5 > 0. 

We thus focus now on finding an upper bound on I{X:Y\Z). We provide 
two approaches: 

1. An upper bound is given by assuming that the legitimate receiver knows 
both his channel and the one of the eavesdropper. 
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2. The same upper bound can also be obtained as follows. Clearly, I{X; Y\Z) 
is upper bounded by taking the maximum over all input distributions 
V{X): 

I(X:Y\Z) < ma.xI(X;Y\Z) = max i(X;Y\Z), 

V(X) KxtO 

where I{X;Y\Z) denotes the value of I{X;Y\Z) when V{X) is optimal. 
We will prove that the optimal distribution is Gaussian. 

Proposition 3 We have the following upper bound: 

I{X- Y\Z) < ma^K.to logdet |^I„ + [HI,, ^b) ( f ) { h'e ) ^"^j 

-\ogdetiI + HeKxH*e), 

where A denotes the correlation between Vm and Ve and satisfies I — A A* >~ 0. 

Proof. An upper bound on I{X:Y\Z) is obtained by assuming that the 
legitimate receiver knows both its channel and the one of the eavesdropper. In 
this case, the capacity of the link between the transmitter and the legitimate 
receiver is that of a MIMO system, namely 

maxlogdet (l„ + (H;,, H^) ( ( ) A^.) . 

Now the channel we consider is degraded, and an upper bound is thus the 
difference of the two capacities, which yields the result. 
We now provide the alternative proof. Clearly 

I{X\Y\Z) < max/(X;r|Z), 

where 'P{X) denotes the input distribution. Now note that 

I{X;Y\Z) = h{Y\Z)- h{Y\X,Z) 

= h{Y\Z)-h{X,Y,Z) + h{X,Z) 

= h{Y\Z) - h{X) - h{Y, Z\X) + h{X) + h{Z\X) 

= h{Y\Z)-h{VE,VM) + h{VE). 

Thus the optimization problem we have to solve is 

max h{X + ^ + Ve) - h{X + Ve). 

V(X) 

Using PropositionfTUl fsee Appendix), the optimal is given by choosing X Gaus- 
sian. Thus we have that 

I{X;Y\Z) - h{Y\Z)-h{VE.VM) + h{VE) 

= h{Y,Z)-h{Z)-h{VE,VM) + h{VE). 
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which, when X is Gaussian, is given by 

\ogdei{KYz) - logdet(i^z) - logdet(XA/£;) + logdet (Kb) 

where Kyz, Kz, Kme and Ke = Iue are covariance matrices, with 

_ f HmKxH^j + In„i HmKxH^ + A 
y HeKxHIi + A* HeKxH*e + 

where A denotes the correlation between Vm and Ve, and 

IriAf A 



A* IjiE 

In order for Kme to be well defined, A has to satisfy I — AA* ^ 0. 
Thus we have 

logdet (^(^ f ) + ( ) ^^)) ~ logdet(i/Bi^xff|: + 1) 

-log det{KME) 

= logdet (^I + ( f ) ( S ) ^^)) " logdet(i/£Xxi/|; - 

where the second equality is well defined if we further require I — AA* >- 0. The 
value of I{X; Y\Z) when X is Gaussian is denoted by I{X; Y\Z): 



I{X-Y\Z)^\ogdei\l+{Hl,, H*^)^ ^ j i^x 

-logdet(I + i/Bi^xi?|;). 



I A \ ^ f H 



(6) 



We can now conclude the proof of the converse for the "simple" cases when 
HIjHm >~ H'^He or H'^He >~ Hl^Hj^i. 

Proposition 4 1. If HljHuj >- H*^He, we have that 

I{X- Y\Z) < max logdet(I + HmKxHIi) - logdet(I + HeKxH*^). 

Kx ho 

2. Vice versa, if H'^He >~ H^jHm, we have that 

I{X;Y\Z) = 0. 

Proof. Let us first compute another way of writing I{X; Y\Z), as defined in 
Note the following factorization: 

f I„„ A \ f I A \ f I-AA* \ / I 



A* V I / V I / V .4* I 
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so that 

f 1 A\'^_f I 0\/(I-AA*)-i 0\/l -A\ 
\ A* I ) ^ \ -A* I ) \ I ) \ I ) 

and we have that 

(HI,, H*e) ( j ) {h'e)^ {HIj-H*eA*)0-~AA*)-\Hm-AHe)+HIHe. 
Thus 

i{X-Y\Z) = \ogdct{I+ {{HIj-H*eA*){I-AA*)-\Hm-AHe)+H*eHe)Kx) 
^\ogdet{I + HeKxH*e). (7) 

1. Since the secrecy capacity does not depend on the noise correlation A, and 
that 

I{X;Y\Z) < max/(X;r|Z), 

Kx 

for all A such that I-AA* >- 0, we are free to take A* = He {HIjHm)~^HIj. 
Indeed, such A does not depend on a choice of Kx-, and since HIjHm >- 
H^He, a satisfies 

l^AA* ^1- Hm{HIiHm)-^H*eHe{HI.iHm)-^HIi >■ 0. 

FinaUy, we are left to show that by replacing A* with H e{HI^jH m)^^ 
in i{X- Y\Z) indeed yields logdet(I+i?A/^xi?M)-logdet(I+i/£:Xxi?|;)- 
Consider thus I{X\ Y\Z) as defined in ([7]). It is enough to show that 

{HI, - HIA*){1 - AA*)-\Hm - AHe) + H*eHe - HIjHm- 

We have that 

{1-AA*)-^ 

= {\ - Hm{HI.iHm) ^HeHe{HI.jHm) ^Hli) ^ 

= ^ + Hm{HI.iHm) ^{{HeHe)^^ — {HIjHm)^^) ^{H'mHm) ^-^1/ 

using the matrix inversion lemma, so that 

HIj{1-AA*)-^Hm = HIjHm + {{H*eHe)-'-{HIjHm)-Y' 

= HIjHm + (i — HeHe{HI^Hm) ^) ^HeHe 

and finally 

(I - H*eHe{HIiHm)'^)HIi{1 - AA*)-^Hm{1 - {HI,Hm)-^H*^He 

— J^M^M — n^tlE- 
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2. Similarly if H^He >~ HI^Hm, wc arc free to choose A* = He{H*^He) ^H^i, 
which satisfies 

I - = I - Hm{H*eHe)-'HI; >- 0. 
Since H^j - H^A* = 0, wc see from (O that 

i{X;Y\Z)^0. 



The cases described in the lemma can be understood as a simple general- 
ization of the scalar case, since those are the degraded cases. When HIjHm >- 
H'^He, all links to the legitimate receiver are better, and the capacity is given 
by the difference of the two capacities, while if H'^He >- HIjHm, then all links 
to the eavesdropper are better, and thus no positive secrecy capacity can be 
achieved. 

We are now left with the case when HI,jHm — H'^He is indefinite, which is 
the non-degraded case, and thus the interesting case to understand. 

3.2 Minimization over A and maximization over Kx 

We have shown in Proposition [3] that 

IiX-Y\Z) < max i(X:Y,Z). 

KxhO 

Since this is true for all A such that I — AA* ;^ 0, we further have that 
I(X;Y\Z) <mmmaxi(X;Y,Z). 

A Kx 

To understand this double optimization, we start by analyzing the function 
i{X;Y,Z). 

Proposition 5 The function I(X;Y, Z) defined in (0) is concave in Kx and 
convex in A. Consequently, 

minmax/(X;r|Z) = maxmin/(X; FlZ) 

A Kx Kx A 

where Kx and A respectively satisfy 

Tr{Kx) =P, KxhO, I - AA* >■ 0. 
Proof. Recall from ([6]) that i{X; Y\Z) is given by 



logdet ( I„ + {HI,, HI) ( ij^ j [ hI ^ \^\ogdei{l+HEKxH*E) 
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1. Convexity in A. Set 

C := 



I A 
A* I 



D 



Now/(X;F|Z) is of the form logdet(I„j,,j+„^+C^^-D), plus some constant 
term, where D ^ 0. It is known that logdet(C) is concave in C [31 
p. 74]), thus logdet(C^^) = — logdet(C) is convex in C, which imphes 
that logdet(I + C^^D) is convex. Furthermore, it is convex in any block 
of C, thus convex in A. Finally, the set of A such that I — AA* ;^ is 
convex. 

2. Concavity in Kx- Recall from ([7]) that 
i{X;Y\Z) 

- logdet(I + {{HI, - H*eA*){1 - AA*)-\Hm - AHe) + H*eHe)Kx) 
-\ogdet{I + HeKxH*e). 

Set 

B := {HI, ~ H*eA*){I - AA*)-\Hm - AHe) + H*eHe. 
We now have that I{X\Y\Z) is given by 

logdet(I„ + BKx) - log dot (I„ + H*eHeKx), 

with B > H*j^He- 

If we compute the gradient of ([H]) with respect to Kx, we get that 

(B-i + Kx)-^ - {{H*eHe)-' + Kx)-' h 0, 
since B y HI-He- Recall that 



(8) 



(9) 



d{x-^) 



kl 



dX,. 



-{X ')ki{X ^)j7, 



so that the derivative of := {{H*^He) ' + Kx) ^ is a x matrix 
given by 

/ —^"^"11 —FF12 ■ ■ ■ —FFin \ 

— FF21 —FF22 ■ ■ ■ ~FF2n 



\ — F Fjii — F Fji2 — F Fji.,1 j 

= ~{{H*eHe)-' + Kx)-' ® {{H*,,He)-' + Kx)-\ 

To check the concavity in Kx , we are thus left to check that 

{{H-^He)-'+Kx)-'®{{H*eHe)-'+Kx)-' ^ {B-'+Kx)-'®{B- 
which is true by 



VKx) 
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3. Since we have shown above that I{X\ Y\Z) is concave in Kx and convex 
in yl, we have that 

mmma.xI(X;Y\Z) = maxmmi(X;Y\Z). 

A Kx Kx A 



From the previous steps of the proof, we now know that 
I{X;Y\Z) < maxniin/(X;r|Z). 

Kx A 

We next compute the minimization over A. Note that we can write I{X; Y\Z) 
in an ahernative way. Recall that 

i{X; Y, Z) = \ogdet{KYz) - logdet(i^z) - \ogdet{K me)- 

By simplifying the Schur complement of det{KYz) with dei{Kz) = dct{Kx + 
Ke), we get that I{X; Y\Z) is given by 

logdetiHM KxHli + - {HmKxH*e + A){HeKxH*e + I)-\HeKxHIj 
- logdet{In,, ~AA*). 

Proposition 6 Let A* be a local minima of I{X;Y\Z). Then 

A* = {He{HI,Hm)-^HI,V,He[H*eHe)-^HIiW){V,W)-\ 

where W is an arbitrary tim x matrix, Q < m < um, and V is an um x 
{nM — "m) matrix, such that 

V 



He{HIiHm) HIjV 

is an invariant subspace of the matrix M , as defined in ill]) . In particular, 
if m — Um, then A* — H e{H'^H e)~^ H'^j . Similarly, if m — 0, then A* = 
HE{HljHMy^Hlj. 

Proof. Let Mi, M2, M^, X be square complex matrices. Set 

f{X) = Ml -{X + M2)M3{X* + A/*). 
It can be shown that 

Wx logdet(/(X)) = -f{X)-\X + M2)M3. 
Using this formula, we compute that 

^A'i{X;Y\Z)^0 ^ f{A){A*+HEKxHlj)-\HEKxH*E+l)^{l-AA*){A*y 
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where 

fiA) = HmKxHI,+1-{HmKxH*e + A){HeKxHI + T)-\HeKxHI, + A*). 

This yields the following nonsymmetric algebraic Ricatti equation 

A*{HmKxHI, + 1)-^HmKxH*eA* + A*[{HmKxHIi + I)-^] 

+ [^HeKxH*e - I + HeKxHIi{HmKxHIi + I)-^ HmKxH*e]A* 
+HeKxHIi{HmKxHIi + - 0. 
One way of solving an algebraic Riccati [4J of the form 

= M21 + M22A* - A*Mii - A*Mi2A*, 

is to look for invariant subspaces of 



M = 



Here we have that M is given by 



A/11 M12 
M21 M22 



-{HmKxHIi + -{HmKxHIj + 1)-^HmKxH*e 

eKxHIj{HmKxHIi + -HeKxH*^ - I + HeKxHIi{HmKxH*j + ly^HuKxh 

(11) 

Set 

p _ ( HmKxHI^ + I„j,,j 

We have that F{M + I„j,,,+„g) is given by 

HmKxH%i —HmKxH^ 
HeKxHIi{HmKxHIj + -HeKxH*e + HeKxHIi{HmKxHI, + I)-'HmKxH% 

It is easy to see that 

= ( -He + HeKxHIAHmKxHI,+I)-^Hm ) (-^^Hl^KxH^) 

which implies that —1 is an eigenvalue of M. Thus a first invariant subspace is 
given by the eigenspace associated to —1, which is the kernel of M + I, or in 
other words, the subspace orthogonal to {—KxHl^, KxH*^): 



Ker{M + 1) 



He {H'^He ) ^ ^ H^j Ui 



for any Ui. Let us now look for the second invariant subspace. We first rewrite 
M as 

^^ = ^"' ( -He + HeKxHI~{HmKxHI, + I)-^Hm ) i-KxHl,,KxH*Eyi. 
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We now show that 

C/2 



He{HI.jHm) ^HIjU2 
is an invariant subspace for any C/2- Indeed, we have that 



F- 



—He + HeKxHI.j{HmKxHI,j + 1)~^Hm 

— (HmKxHI^ + I)~^Hm 
He + HeKxHI,j{HmKxHIj + I)^^Hm 

u (u* u ] {HmKxHIj + \y^H^ 



M 



Since 



-He{1 - KxHIj{HmKxH;j + 1)-^Hm) 

-He{{HI,Hm)-^HIj{HmKxHIj + I) - KxHI,){HmKxHIj + 1)-^Hm 
-He{HIjHm)~^ Hly[{HMKxHlj +1) ^Hm- 



Thus, a Jordan basis of A/ is given by 

IriM Ins 

Finally, solutions of the Ricatti equation are given by [1] 

A* - iHEiHljHM)-'HljV,HEiH*EHE)-'HljW){V,Wy\ 

where W is an um x rn matrix, < m < njv/, and F is a um x nM — tn matrix, 
such that 

( 

\ He^HIjHm) ^H'IjV J 

is an invariant subspace of M. Note that W can be chosen arbitrary since 
{1,He{H*^He)'^HIi) is the eigenspace associated to —1. 

■ 

Proposition 7 Let Kx be an optimal solution to the optimization problem 

m&xKx mmAi{X;Y\Z) 

s.t. Kx h 0, Tr{Kx) = P, 

where A* = {HEiHliHuT^ HljV, HE{H%HEy^W){V,W)~^ is the optimal 
solution for the minimization over A. Then Kx is a low rank matrix. 

Proof. We have seen in ([5]) that I{X; Y\Z) can be written 

logdet(I + BKx) ~ logdet(I + HeKxH*^), 
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where 

B := (Hli - H*EA*)iI - AA*)-\Hm - AHe) + H*eHe. 
Using the matrix inversion lemma, we have that 

B-i = {H*eHe)-^ - {H*EHEr\Hl, - H*eA*)- 
(I _ AA* + {Hm - AHe){H*eHe)-\HIj - H*eA*))-\Hm - AHe){H*eHe)-\ 

so that 

s-i - {h%He)-' - -{h*eHe)-\hi, - H*eA*)- 

(I _ AA* + {Hm - AHe){H*eHe)-\HI, - H%A*))-^{Hm - AHe){H%He)-' . 
Now 

{H*eHe)-HHIi - H*eA*) 
- {H*eHe)-^[HIi - H*e{He{H*mHm)-'HIiV, He{H*eHe)-^W){V, W)-^] 
^ [{H*eHe)-^HIj{V, W) - {{HIjHm)-'HIjV, {H*EHEr'W)]{V, W)-' 
= m*EHE)-' - iHl,HM)-')Hl,V, 0){V, Wy' 

thus {H^HEy^iHlj - H*j^A*) is low rank and consequently B-^ - {H*j^He)~'^ 
is. 

Now, from Proposition[2l we know that either B~^ -< [H'^H e)~^ and A > 0, 
or B~^ >~ {H'^He)^^ and A < 0. This gives a contradiction since 
{H'^He)^^ , implying that Kx has to be low rank. 



Proposition 8 Knowing that the rank of Kx is r < n, the optimal solution to 

min/(X;r|Z) 

is given by 

A* - {HEiHljHM)-'HliBHMUxV,HEiH*EHEy'H*MW)iBHMUxV,W)'' 
where Kx = UxU*x and B = {HmKxHIj + l)-^ . 
Proof. The Jordan decomposition of M is now given by 

f He{HI,jHm)^^HIj He{H'^He)^^ ) 
\^ He{HIjHm) ^HIj He{H^He) ^HIj J \ -1 J 

where 

J — —{HmKxHm + I) ^{HmKxH*eHe{H*mHm) + I). 
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Let us now look more carefully at J. We first show that when Kx is low rank, 

— 1 is an eigenvalue. Indeed, we have 

-{HmKxHI, + 1)-\HmKxH*eHe{HIjHm)~^HI, + I) + I 
= -{HmKxH*m + 1)-^HmKx{H*eHe{HIiHm)-^ - l)Hli. 

This is enough to show that —1 is an eigenvalue since det(_ft'x) = by assump- 
tion that Kx is low rank. The above computation also tells us that 

— {HmKxHI^+T) ^{HmKxH'^He{HIjHm) ^HIj+T) 
= -{HmKxHIj + 1)-^HmKx{H*eHe{HIjHm)-^ - \)Hlj - I. 

Since Kx is low rank, it can be factorized as Kx = UxU^ where Ux is a n x r 
matrix, if r < n denotes the rank of Kx- Clearly, {HmKxHm + ^)^^ H mU x is 
an invariant subspace of J. A Jordan basis is thus given by 

P = ( {HmKxHI, + 1)-^HmUx Q ) 

where Q is the eigenspace associated to —1. Set B := {HmKxHI^ This 
thus gives us a more precise Jordan basis for M (as defined in (|11[) ). namely 

P I 

He{HIjHm)-^HIiP He{H*eHe)-^HIj 
BHmUx Q I 

He{HIjHm)~^HmBHmUx He{HIjHm)^^HIjQ He{H'^He)^^H'^j 

In this decomposition, the third block is the eigenspace of —1 of dimension um 
which is always present. The middle block also corresponds to an eigenspace of 

— 1, of dimension um — this one appearing only when Kx drops rank. The 
first block is an invariant subspace, corresponding to the r eigenvalues of M 
that are different from —1. 

From this Jordan basis of M, we have that 

A* = {He{HIiHm)-^HIiBHmUxV,He{H*eHe)-'H*mW){BHmUxV,W)-' 

is a solution of the Ricatti equation, where W is any um x ('t-m ~ ^) matrix, 
and V is any r x r matrix. 



3.3 The converse matches the achievabihty 

So far, we have solved the optimization problem 

minmaxJ(X;r|Z) 

A Kx 

by computing the optimal A in a closed form expression, and by showing that 
the optimal Kx is low rank. We are now ready to conclude the proof, by proving 
that the optimal A makes the converse match the achievabihty. 
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Proposition 9 Set B = {HmKxHIj + and let 

A* = {He{HIiHm)-^HIjBHmUxV,He{H*eHe)-^H*mW){BHmUxV,W)-^ 
he a solution of the Ricatti equation. Then 

i{X; Y\Z) = logdet(I + HmKxHIi) - logdet(I + HeKxH*^). 
Furthermore, there exists V, W such that I — A A* >- 0. 
Proof. Recall from ^ that a way of writing I{X; Y\Z) is 



-1 



I A\ ( Hm 



logdet \I+ {Hit, H*e)[ J 1 ( ]Kx]~logdetiI+HEKxH*E). 



where 



Thus 



so that 



( I 


A ^ 








I A* 






, A* 





I \ / I .4 
1- A*A I [ I 



A* I ) ~[o I/lo {I-A*A)-^ ) [ -A* I 



{HI,,H*e) [ I* i) ( ]=HI,Hm+{-HI,A+H*e){1-A*A)-\-A*Hm+He) 
and 



I{X- Y\Z) = logdet(I + HIjHmKx + {-Hl[A + H*e){1 - A*A)-\-A*Hm + He)Kx) 

-logdct{l + HeKxH*e). 

We now show that Kx is in the kernel of —A*Hm + He- We have that 

(BHMUxV^Wy'HMKx = {HmUxV,B-'W)-^B-^HmKx 

= {HmUxV, B-^W)-^HmUxU*x{HIjHmKx + I) 
{ V-'U*x{HIjHmKx+I) 

so that 

A* HmKx = He{H'mHm) ^Hl^BHMUxUxiHljHMKx + 
= He{H'^jHm) ^H^jBB ^HmKx 
= HeKx, 

and thus A*HmKx — HeKx, so that we get 

i{X; Y\Z) = logdet(I + HmKxHIj) - logdet(I + HeKxH*^) 
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We now have that 
I - AA* y 



^*^*'w*^HM{H%Htr^^^^^ ) iHE{Hl,HM)-'Hl,BHMUxV,HE{H%HE)-^Hl,W) h 

V*UxHIjB(I — Hm{HIjHm)~^H^He(HIjHm)~^HIj)BHmUxV 

W*{I- Hm{H%He)-^HIj)W 

since 

V*U*xHl,B*W - V*U^H*mB*Hm{H*^Hm)-^H*mW 
= V*U*x[H*mB* - HIjB*Hm{HIjHm)-^HIj]W = 0. 

To conclude the proof, notice that we have 

I — Hm{HIjHm)~^H%He{H'IjHm)~^HIj < H^Hm ^ H%He 

and 

I — Hm{H'^He)~^H1i -< <^=^> H^He -< HIjHm- 

Thus if H^^Hm — H^He is indefinite, there exists V and W such that the above 
matrix is positive definite. 



^0, 



4 Conclusion 

In this paper, we considered the problem of computing the perfect secrecy ca- 
pacity of a multiple antenna channel, based on a generalization of the wire-tap 
channel to a MIMO broadcast wire-tap channel. We proved that for an arbi- 
trary number of transmit/receive antennas, the perfect secrecy capacity is the 
difference of the two capacities, the one of the legitimate user minus the one of 
the eavesdropper. 



Appendix 

Proposition 10 Let A,B he circularly symmetric complex jointly Gaussian 
random vectors with strictly positive definite covariance matrices. Let X be 
a random vector independent of A and B, and S be a positive definite matrix. 
The optimal solution to 

maxP(X) h{X + A,X + B)-h{X + B) 
s.t. Tr{Kx) = P 

is Gaussian. 
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Proof. First note that 

^J\X + BJ [V2X + ^{A + B) )■ 
Since multiplication by a unitary matrix does not change the entropy, 
h{X + A,X + B) 
. /,(v^A- + -L(A + B),-L(^-B)) 

= h(^V2X + ^iA + B)l^{A-B) 

= h{V2X + U) + h(^^{A~ B] 

where U is Gaussian with covariance matrix Ku given by 

lE[{A + B){A + B)*] - lE[{A + B){A - B)*]- 
E[{A - B){A - B)*]-^E[{A - B){A + B)*], 

using conditional Gaussian distribution. 
To maximize 

h{X + A,X + B)- h{X + B), 
we thus need to maximize 

h{^/2X + U) -h{X + B), 

or equivalently 

h{X + U') - h{X + B) 

where U' = is Gaussian, independent of X. The optimal distribution of 

such expression has been shown to be Gaussian by Liu and Viswanath [15] in 
the case of real Gaussian vectors. Their result can be readily extended to the 
circularly symmetric complex Gaussian case. 



Lemma 2 If A = A* and B = B* >- 0, then the matrix AB has all positive 
eigenvalues. 

Proof. Since yl ^ 0, we can write A = ^^/^(A*)^/^ with A^/^ invertible. 
Therefore, 

has the same eigenvalues as the matrix {A*y/^BA^/'^, which is positive definite. 
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